Fuzzy relational thesauri in information retrieval: automatic knowledge base expansion by means of classified textual data
نویسندگان
چکیده
In our ongoing project we develop a tool which provides domain engineers with a facility to create fuzzy relational thesauri (FRT) describing subject domains. The created fuzzy relational thesauri can be used as knowledge base for an intelligent information agent when answering user queries relevant to the described domains, or for textual searching on the web. However, the manual creation of (fuzzy) thesauri is quite tedious process if the source of data from which the domain engineer may select concepts and instances for the thesaurus is not well organized or structured. That is the typical case of textual data bases. In order to ease FRT creation process we make use of a small starting FRT and our text categorization technique that temporarily expands FRT while doing the supervised learning part of text categorization. This by-product of categorization is then used for enlarging automatically or semi-automatically the final FRT.
منابع مشابه
Fuzzy relational thesauri expansion using categorized textual data
In our ongoing project we develop a tool which provides domain engineers with a facility to create fuzzy relational thesauri (FRTi) describing subject domains. The created FRTi can be used as knowledge base for an intelligent information agent when answering user queries relevant to the described domains, or for textual searching on the web. However, the manual creation of thesauri is quite ted...
متن کاملCreation and Maintenance of Query Expansion Rules
In an information retrieval system, a thesaurus can be used for query expansion, i.e. adding words to queries in order to improve recall. We propose a semi-automatic and interactive approach for the creation and maintenance of domain-specific thesauri for query expansion. Domain-specific thesauri are especially required in highly technical domains where the use of general thesauri for query exp...
متن کاملUser Comprehension and Searching with Information Retrieval Thesauri
While information retrieval thesauri may improve search results, there is little research documenting whether general information system users employ these vocabulary tools. This article explores user comprehension and searching with thesauri. Data were gathered as part of a larger empirical query-expansion study involving the ProQuest‚ Controlled Vocabulary. The results suggest that users’ kno...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملSimilarity Thesauri and Cross-Language Retrieval
This paper describes a method for constructing a thesaurus automatically from a corpus of suitable documents, using standard information retrieval methods. The resulting thesauri can be used for user-initiated query expansion, automatic query expansion, as well as cross-language retrieval. Researchers at the Swiss Federal Institute of Technology in Zürich developed and evaluated this method in ...
متن کامل